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CHIMERIC MUTATIONAL VECTORS HAVING NON - NATURAL NUCLEOTIDES 
1. FIELD OF THE INVENTION 

The invention concerns compounds and methods of their 
use to make specific genetic alterations in the genome of 
target eukaryotic cells. More specifically, the invention 
concerns the introduction into the nucleus of the target cell 
of an oligonucleobase compound, termed a Chimeric Mutational 
Vector (CMV) , having a sequence that has regions of homology 
and one or more differences with a gene of the -arget cell 
that is to be altered (the "target gene"). The structure of 
the CMV is designed so that genetic recombination between the 
CMV and the target gene occurs, i.e., the sequence of the CMV 
replaces the sequence of the target gene. 

2. BACKGROUND OF THE INVENTION 

2.1. Site-Directed Genetic Alteration in Eukaryctic Cells 

Those skilled in the art of molecular biology recognize 
that on frequent occasions it is desired not merely to 
introduce a new polynucleic acid sequence, i.e, a new gene, 
into a targeted eukaryotic cell, but rather to alter a 
defined, pre-existing gene in the targeted cell. The 
targeted cell can be used in culture or it can be used to 
construct a transgenic animal. 

A wide variety of techniques have been developed to 
introduce DNA into cultured eukaryotic cells. These 
techniques include calcium phosphate precipitation and DEAE- 
dextran mediated endocytosis, electroporation, liposome 
mediated fusion and transduction with replication incompetent 
viruses. However, while such techniques can quite often 
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introduce functional genes into the eukaryotic cell, these 
techniques do not readily accomplish an alteration (mutation) 
in a specific existing gene. After introduction the exogenous 
DNA is isolated at a random position in the cell's genome by 
illegitimate recombination, rather than at a specific 
position by homologous recombination. 

Prior to the present invention, there was no generally 
satisfactory scheme for introducing a site-specific genetic 
alteration in a higher eukaryote , i.e, in mammalian or avian 
cells. Although homologous recombination can be obtained in 
higher eukaryotic cells by introduction of very long {> 1 kb) 
nucleic acids, these techniques require the application of 
elaborate selection techniques because the rate of 
illegitimate recombination in higher eukaryotes greatly 
exceeds that of homologous recombination. Thomas, K.R. & 
Capecchi, M.R., 1987, Cell 52:503. See, also, Valancius, V. & 
Smithies O. , 1991, Mol . Cell. Biol. 11:4389 (comparison 
homologous recombination of linearized and supercoiled 
plasnids in eukaryotic cells) . 

One approach to achieving a predominantly site-directed 
mutagenesis has been the introduction of single stranded 
oligodeoxynucleotides directly into the cell. This 
techniques has been successfully employed in the yeast 
Saccharowyces cerevisiae, in which homologous recombination 
is significantly more active than it is in higher eukaryotes. 
Moerschell, R.P., et al . , 1988, Proc .Natl .Acad. Sci . 85:524- 
28; Yamamoto, T., et al . , 1992, Yeast 8:935-48. However, to 
date there have been no reports of the successful 
transformation of mammalian or avian cells by single stranded 
oligonucleotides . 

- 2 - 
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convincingly sho«n to result in a rate homologous 
recombination in any viable eufcaryotic cell significantly 
greater than the spontaneous rate of homologous 

recombination. 

2 2 -) -i^Hrt,., Having PWPtffl nnrm P« t « 

The inclusion of a publication or patent application xn 
this section is not to be understood as an admission that the 
publication or application occurred prior to the present 
invention or resulted from the conception of a person other 

than the inventor. 

An oligonucleotide having complementary 
deoxyribonucleotides and ribonucleotides and containing a 
sequence homologous to a fragment of the bacteriophage 
M13m P 19, was descried in Kmiec , E.B.. et al . . November 1994. 
Mol and Cell. Biol. 14:7163-7172. The oligonucleotide had a 
single contiguous segment of ribonucleotides. Kmiec et 
showed that the oligonucleotide was a substrate for the REC2 
homologous pairing enzyme from Ustilago maydis. 

Patent publication WO 95/15972, published June 15, 1995, 

,.„ n „ c natent application Serial No. 
and corresponding U.S. patent a Vt > ^ 

08/353,657, filed December 9, 1994, by E.B. Kmiec, described 
CMV for the introduction of genetic changes in eutaryotic 
cells Examples xn a Ustilago .aydis gene and in the murine 
ras gene were reported. The latter example was designed to 
introduce a transforming mutation into the ras gene so that 
the successful mutation of the ras gene in NIH 3T3 cells 
would cause the growth of a colony of cells 
(••transformation") . The WO 95/15972 publication reported 

rate of transformation of NIH 3T3 was less 
that the maximum rate oi "auoi. 

than 0.1 %. i-e., about 100 transformants per 10* cells 
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exposed to the ras CMV. In the Ustilago maydis system the 
rate of transf ormants was about 600 per 10 6 . A chimeric 
vector designed to introduce a mutation into a human bcl-2 
gene was described in Kmiec, E.B. February 1996, Seminars, in 
Oncology 23:188. 

A CMV designed to repair the mutation in codon 12 of K- 
ras was described in Kmiec, E.B., December 1995, Advanced 
Drug Delivery Reviews 17:333-40. The CMV was tested in Capan 
2, a cell line derived from a human pancreatic 

adenocarcinoma, using LIPOFECTIN™ to introduce the CMV into a 
the Capan 2 cells. Twenty four hours after exposure to the 
CMV, the cells were harvested and genomic DNA was extracted; 
a fragment containing codon 12 of K-ras was amplified by PCR 
and the rate of conversion estimated by hybridization with 
allele specific probes. The rate of repair was reported to 
be approximately 18%. 

A CMV designed to repair a mutation in the gene encoding 
liver/bone/kidney type alkaline phosphatase was reported in 
Yoon, K., et al . , March 1996, Proc. Natl. Acad. Sci . 93:2071. 
The alkaline phosphatase gene was transiently introduced into 
CHO cells by a plasmid. Six hours later the CMV was 
introduced. The plasmid was recovered at 24 hours after 
introduction of the CMV and analyzed. The results showed 
that approximately 30 to 38% of the alkaline phosphatase 
genes were repaired by the CMV. 

United States Patent Application Serial No. 08/640,517, 
filed May l f 1996, by E.B. Kmiec, A. Cole-Strauss and K. 
Yoon, discloses methods and CMV that are useful in the 
treatment of genetic diseases of hematopoietic cells, e.g., 
Sickle Cell Disease, Thalassemia and Gaucher Disease. 
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3. BRIEF DESCRIPTION OF THE FIGURES 

Figure 1. General form of one embodiment of a Chimeric 
Mutational Vector. 

Figures 2A and 2B. Figure 2A shows the sequence and 
structure of oligonucleotide Dhl (SEQ ID No: 18) and chimeric 
oligonucleotides Chi (SEQ ID No:15) f Ch2 (SEQ ID No:16)and 
Ch3(SZQ ID No:17). Figure 2B illustrates the relationship 
between the sequence of the chimeric vector Chi and nt 693- 
728 of the alkaline phosphatase gene (SEQ ID No: 19) . DNA 
nucleotides are UPPER CASE; RNA nucleotides are lower case. 
Figure 3. The sequences of the codons 3-9 and adjacent 
dinucieotides of codons 2 and 10 of S^-globin (nt 1-25 of SEQ 
ID No:21), S A -globin (nt 1-25 of SEQ ID No:20), 6-globin (SEQ 
ID No: 25), and chimeric vectors SC1-SC5 (SEQ ID Nos : 20-24). 
DNA and RNA nucleotides are indicated as in Figure 2A. 
Figure and 43, Figure 4A and 4B show the fraction of 
copies of S-globin converted from 6 s to fi A as a function of nM 
SCI added and of fi A to £ s as a function nM of SC2 , 
respectively, in cultures of EB- transformed lymphoblasts . 
Figure 5. Figure 5 shows the fraction of copies of S-globin 
converted fc A to S s as a function of ng SC2 added to cultures 
of cd34* hematopoietic stem cells. 

4 . DEFINITIONS 

The invention is to be understood in accordance with the 
following definitions. 

An q1 igpnUGleobfr^g is a polymer of nucleobases, which 
polymer can hybridize by Watson-Crick base pairing to a DNA 
having the complementary sequence. 

NucleobasfiP; comprise a base, which is a purine, 
pyrimidine, or a derivative or analog thereof. Nucleobases 

- 6 - 



BNSDOC1D: <WO_G748714Al_l_> 



WO 97/48714 

PCT/US97/10S38 



include peptAde nucleohflff<»p, the subunits of peptide nucleic 
acids, and morpholine nucleobases as well as nucleosides, 
nucleotoids and nucleotides. NuclensiriP. are nucleobases 
that contain a pentosef uranosyl moiety, e.g., an optionally 
substituted riboside or 2 • -deoxyriboside , and have a linkage 
to other nucleobases that does not contain a phosphorus. 
Nucleate.? dff are pentosefuranosyl containing nucleobases 
having linkages that contain phosphorus, e.g., 
phosphorothioates, phosphoramidates and methylphosphonates, 
but not phosphates. Nucleotide are pentosefuranosyl 
containing nucleobases that are linked by an unsubstituted 
phosphodiesters . 

A pUqpnucJ,eohase sh&in has a single 5 ' and 3 ' 
terminus, which are the ultimate nucleobases of the polymer. 
A particular oligonucleobase chain can contain nucleobases of 
all types. An gUgonuc^phpsp coimonnH is a compound 
comprising one or more oligonucleobase chains that are 
complementary and hybridized by Watson-Crick base pairing. 

Nucleobases are either deoxyribo-type or ribo-type. 
RAbo-tvpe nucleohaspR are pentosefuranosyl containing 
nucleobases wherein the 2' carbon is a methylene substituted 
with a hydroxyl, alkyloxy or halogen. Deoxvrihn-ryp p 
nuc^epbasep are nucleobases other than ribo-type nucleobases 
and include all nucleobases that do not contain a 
pentosefuranosyl moiety. 

An PUgpnuclPphfl^e Strand generically includes both 
oligonucleobase chains and segments or regions of 
oligonucleobase chains. An oligonucleobase strand has a 3' 
end and a 5 1 end. When a oligonucleobase strand is 
coextensive with a chain, the 3- and 5 • ends of the strand 
are also 3' and 5' termini of the chain. 
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A region is a portion of an oligonucleobase , the 
sequence of which is derived from some particular source, 
e.g., a CMV having a region of at least 15 nucleotides having 
the sequence of a fragment of the human B-globin gene. A 
segment is a portion of a CMV having some characteristic 
structural feature. A given segment or a given region can 
contain both 2 1 -deoxynucleotides and ribonucleotides. 
However, a ri bo- type segment or a 2 ' -dep^yribo- type segment 
contain only ribo-type and 2 ' -deoxyribo-type nucleobases 
respectively. 

5 . SUMMARY 

The present invention provides oligonucleobase 
compounds, termed Chimeric Mutational Vectors (CMV) . CMV can 
be used to introduce specific genetic changes in plant and 
animal cells. The invention is applicable to in the field of 
medicine for gene therapy, and in the fields of biomedical 
research, pharmaceutical production and agriculture to 
construct specifically mutated plants and animals. A CMV 
comprises two complementary oligonucleobase strands. The two 
strands can be present on a single chain or on two chains 
that can be optionally linked by any chemistry for cross- 
linking oligonucleotides. 

The sequence of the strands of a CMV is homologous to 
the target gene except for a mutator region which introduces 
the genetic change into the target gene. The CMV can also 
contain regions having a sequence unrelated to the target 
g ene< The mutator region must be directly adjacent in both 
3 1 and 5 1 directions to a homology region of at least one 
base. 
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The oligonucleobases of the CMV are either ribo-type or 
2 1 -deoxyribo-type. Ribo-type nucleobases contain a 
pentosefuranosyl moiety having a 2' oxygen or halogen. At 
least three contiguous bases of the homologous region of the 
first strand are ribo-type nucleobases that are Watson-Crick 
base paired to deoxyribo-type nucleobases of the second 
strand. Nucleobases that are sensitive to RNase are not 
preferred for use in the invention. 

.6. DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides compounds, termed 
Chimeric Mutational Vectors (CMV) that can be used to make 
specific changes in the genome of a eukaryotic cell. The CMV 
are comprised of polymers of purines and pyrimidines that 
hybridize, i.e.. form a Watson-Crick base pairs of purines 
and pyrimidines, to DNA having the appropriate sequence. 
Each CMV is divided into a first and a second strand of at 
least 15 bases each that are complementary to each other and 
can be, but need not be, covalently linked. The polymers of 
purines and pyrimidines, termed oligonucleobases, are 
comprised of two types of subunits, termed nucleobases. 
There are two types of nucleobases. Ribo-type nucleobases 
are ribonucleosides having a 2'hydroxyl, substituted hydroxyl 
or 2 'halo-substituted ribose. All nucleobases other than 
ribo-type nucleobases are deoxyribo-type nucleobases. 

The sequence of the first and second strands consists of 
at least two regions that are homologous to the target gene 
and one or more regions (the "mutator regions") that differ 
from the target gene and introduce the genetic change into 
the target gene. The mutator region is directly adjacent to 
homologous regions in both the 3 1 and 5 ' directions. In a 

- 9 - 
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preferred embodiment of the invention, each mutator region is 
adjacent in both the 3' and 5* directions to homologous 
regions of at least three ba9es. In a preferred embodiment 
of the invention, each mutator region is flanked in both the 
3' and 5* directions by ribo-type oligonucleobase segments of 
at least three bases, which segments need not be adjacent to 
the mutator region. The flanking ribo-type nucleobase 
segments need not be directly adjacent to the mutator region, 
i.e, a portion of the homologous region comprising deoxyribo- 
type nucleobases can intervene. The total length of all 
homologous regions is preferably at least 14 bases. If the 
CMV contains two homologous regions separated by a mutator 
region, the homologous regions can more preferably be each 
between 8 and 12 bases long and most preferably be 10 bases 
long . 

At least two homologous regions of the first strand are 
comprised of at least three contiguous ribo-type nucleobases 
which are Watson-Crick paired to deoxyribo-type nucleobases 
of the second strand. In a preferred embodiment there are 
between 9 and 2 5 ribo-type nucleobases and more preferably 2 0 
ribo-type nucleobases in the first strand, which are Watson- 
Crick paired to deoxyribo-type nucleobases of the second 
strand. In one embodiment there are no ribo-type nucleobases 
in the second strand. In one embodiment the mutator region 
of the first strand consists of deoxyribo-type nucleobases 
and is flanked by deoxyribo-type nucleobases. Alternatively, 
the mutator region can be comprised of ribo-type nucleobases 
of the first strand and deoxyribo-type nucleobases of the 
second strand. 

The CMV is still further characterized by containing at 
least three nuclease resistant ribo-type nucleobases. In a 

- 10 - 
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preferred embodiment all ribo-type nucleobases are nuclease 
resistant . 

The mutator region can be as large as 2 kilobases and 
can encode an exon. Preferably the mutator region consists 
of 2 0 or fewer bases, more preferably 6 or fewer bases and 
most preferably 3 or fewer bases. The mutator region can be 
of a length different than the length of the sequence that 
separates the regions of the target gene homologous with the 
r.onclogous regions of the CMV so that an insertion or 
deleticr. of the target gene results. When the CMV is used to 
introduce a deletion there is no base identifiable as within 
the mutator region. Rather, the mutation is effected by the 
juxtaposition of the two homologous regions that are 
separated in the target gene. In one embodiment the mutator 
region is a deletion of from 6 to 1 bases or more preferably 
from 3 to 1 bases. Multiple separated mutations can be 
introduced by a single CMV, in which case there are multiple 
mutator regions in the same CMV. Alternatively multiple CMV 
can be use simultaneously to introduce multiple genetic 
changes in a single gene or, alternatively to introduce 
genetic changes in multiple genes of the same cell. 

In a preferred embodiment the CMV is resistant to RNase. 
Therefore, the use of only naturally occurring ribo-type 
nucleobases, which are sensitive to RNase, is not suitable 
for use in the invention. The ribo-type nucleobases of a CMV 
should include at least three ribo-type nucleobases 
preferably selected from non-phosphodiester linked 
ribonucleotides ( "ribonucleotoids" ) , 2 ' O-substituted or 
2 ' haloribonucleotides, 2 ' O-substituted or 
2 "haloribonucleotoids, and ribonucleosides , which can be 
optionally 2' substituted. In a preferred embodiment of the 

- 11 - 
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CMV, no nuclease sensitive, i.e., 2 'O-ribonucleotides, are 
used . 

In one embodiment the CMV is a single oligonucleobase 
chain of between 40 and 100 bases. In an alternative 
embodiment, the CMV comprises a first and a second 
oligonucleobase chain, each of between 20 and 100 bases; 
wherein the first chain comprises the first strand and the 
second chain comprises the second strand. The first and 
second chains can be linked covalently or, alternatively, can 
be associated only by Watson-Crick base pairings. 

6.1. USES OF CHIMERIC MUTATIONAL VECTORS 

Chimeric Mutational Vectors can be used to introduce 
changes into the sequence any eukaryotic gene, which has a 
known sequence. The change can result in the replacement of 
one or more nucleotides or can be an insertion or deletion of 
one or more nucleotides. In preferred embodiments the 
replacement, insertion or deletion can be of 20 or fewer 
contiguous bases, in a more preferred embodiment the 
replacement, insertion or deletion can be of 6 or fewer bases 
and most preferably of 3 or fewer bases. The insertion can 
be as long as about two kilobases. The insertions or 
deletions can be made in the coding and the regulatory parts 
of the gene. 

Cells can be transfected with the CMV by any technique 
now known or to be developed for transfecting cells with DNA. 
Such techniques include electroporation, liposome transfer 
and calcium phosphate precipitation. In one embodiment the 
transfection is performed with a liposomal transfer compound, 
e.g., DOTAP (N- 1 1 - (2 , 3 -Dioleoyloxy) propyl] -N, N, N- 
trimethylammonium methyl sulfate, Boehringer-Mannheim) or an 
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equivalent, such as LIPOFECTTN. The amount of CMV is not 
critical to the practice of the invention; good results can 
be achieved with 10 nM/10- cells. A ratio of about 500 ng of 
CMV in 3 /ig of DOTAP per 10 5 cells can be used. The 
transfection technique of Examples 1-3, infra., can be used 
with the modification that the transfected cells are cultured 
in serum free media, media supplemented with human serum 
albumin or human serum. 

The present invention encompasses methods of using 
Chimeric Mutational Vectors. Uses of Chimeric Mutational 
Vectors include the repair of genetic diseases such as 
Gaucher Disease, thalassemia and Sickle Cell disease. Other 
applications include the introduction of stop codons or frame 
shift mutations to make "knock-outs," i.e., transgenic 
animals or plants that lack a functional copy of a specific 
gene, as well as transgenic animals or plants having specific 
mutations. In a still further method of use encompassed by 
the invention specific mutations can be made for the purpose 
of studying the structure function relationships of genes-of- 
interest. Alternatively, if a desirable mutation has been 
identified in one species, it can be introduced in the 
homologous genes of other species by use of the CMV. 

For medical purposes, the invention can be used to 
repair mutations or introduce mutations into any cell-type 
that can be removed from a subject's body, cultured and 
reimplanted into the subject. Techniques for the removal, 
culture and reimplantation of hepatocytes, in particular of 
hepatic reserve (stem) cells, have been described in patent 
publication WO94/08598 to Naughton, G.B. and Sibanda, B., 
April 28, 1994. Examples of genetic diseases that can be 
cured by repair of mutations in hepatocytes include: familial 
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hypercholesteremia, caused by mutation in the LDL receptor; 
emphysema that caused by a mutation in the al -antitrypsin 
gene; and hemophilia and Christmas Disease, which are caused 
by mutations in coagulation factors VIII and IX, 
respectively. 

In a yet further use of the invention, CMV can be used 
to mutageni2e a population of cells so that a mutant, having 
a selectable phenotype can be obtained. According tc this 
aspect cf the invention, a mixture of CMV having a mutator 
regicn cf one or several nucleotides is synthesized so that 
the three non-wild type nucleotides are present at each 
position of the mutator region. The treatment of a 
population of cells with such a mixture of CMV will induce a 
variety cf mutations in the target gene. After an 
appropriate selection step, a mutant having the desired 
phenotype can be recovered. 

6.2. STRUCTURE OF EXEMPLARY CHIMERIC MUTATIONAL 

VECTORS 

6.2.1. SINGLE CHAIN CHIMERIC MUTATIONAL VECTORS 

In one embodiment a Chimeric Mutational Vector (CMV) is 
a single 5' ,3' -linked oligonucleobase chain comprised of 
between about 4 0 and about 100 pentosef uranosyl containing 
nucleobases. A single chain CMV can contain unpaired 
nucleotides, which form one or two hair-pin turns, which turn 
or turns divide (s) the CMV into the first and second strands, 
so that at least 15 bases of the first strand can be Watson- 
Crick paired to bases of the second strand. 

Figure 1 shows the structure of one embodiment of a 
single-chaim CMV with segments (a) - (h) . In the embodiment of 
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Figure 1, the first strand consists of segments (c) , (d) and 
(e) , which is complementary to a second strand consisting of 
segment (a) . In this particular embodiment the 3 • terminus 
of the CMV is illustrated to be at the 3- end of the (a) 
segment and the 5 • terminus is shown to be at the 5 ' end of 
the (h) segment. However, the location of the termini and 
the orientation of 3 • and 5 ' directions of the CMV with 
respect to the segments can be elsewhere, so long as the 
termini do not interrupt the homologous or mutator regions of 
the first or second strands. The segments are labeled 
sequentially (a) through (h) . 

In one embodiment the lengths and characteristics of the 
segments are as follows. Segment (a) is between 16 and 40 
nucleotides and preferably between 20 and 3 0 nucleotides. 
The sequence of the region of segment (a) can be either that 
of the coding strand or the non-coding strand of the gene 
that contains the intended mutation (the "mutated target 
gene"). The location of the sequence of segment (a) must 
include the portion of the target gene that is to be changed. 
Unless the target gene is not normally transcribed in target 
cell, it is preferred that the sequence of segment (a) is the 
sequence of the coding strand of the target gene. When the 
target gene is not transcribed in the target cell, then 
neither the coding strand sequence or the non- coding strand 
sequence is preferred. The sequence of segment (a) 
determines the sequences and combined lengths of segments 
(c)-(e), which must be complementary to segment (a). 

The oligonucleobases of the portion of segment (a) that 
are base paired with segments (c) and (e) can be any 2'- 
deoxyribo-type nucleobases The nucleobase of segments (c) 
and (e), which are termed ribonucleotide segments, can be any 
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ribo-type nucleobase 2' O-ribonucleotides , i.e., nucleotides 
that is known or will be developed. In a preferred 
embodiment, the nucleotides of segment (d) , which is termed 
the intervening segment are 2 1 -deoxyribo- type nucleobases . 
Alternatively, segment (d) can be made of ribo-type 
nucleobases; in which case the boundaries between segments 
(c) , (d) and (e) are not defined. Segments (b) and (f ) 
through (h) can be of any type of nucleobase. 

In a preferred embodiment, the sequence of segments (c) 
and (e) are completely homologous to the target gene. 
However, a one base mutator region in the (c) or (e) segment 
can result in the mutation of the target gene at the 
homologous position. 

Segments (b) and (g) are about 4 nucleotides in length 
and form single stranded hairpin turns that allow segments 
(a) and (c) - (e) and segments (f) and (h) to form Watson-Crick 
base pairs, i.e., to form duplex nucleic acids. In 
alternative embodiments the function of segments (b) and (c) , 
which is to covalently link the first and second strands, can 
be served by non-oligonucleobase moieties. 

Segments (c) and (e) , also termed the first and second 
ribo-type segments, consist, in one embodiment, of 2 ' -0- 
methylribonucleotides . In a preferred embodiment, segments 
(c) and (e) are independently, between 6 and 13 nucleotides. 

Segment (d) , also termed the intervening segment, in one 
embodiment , is between 4 and 20 nucleotides in length. If 
the target gene contains two or more point mutations that are 
separated by fewer than 15 nucleotides, each can be repaired 
by the same CMV. 

Segments (f) and (h) form a duplex that brings the 3' 
and 5' ends of the CMV, which is nicked between segments (a) 
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and (h) into juxtaposition. The structure formed by segments 
(f ) , (g) and (h) is termed a hairpin cap. A hairpin cap 
contains a terminal end and a nonterminal end. The terminal 
end forms a terminus of the chain, which can be either a 5' 
or a 3 1 terminus. The function of a hairpin cap is to 
control the position of the 3' or 5 1 terminus. The non- 
terminal end of the hairpin cap can be ligated to an end of a 
strand, whereby the end of the complementary strand, which is 
the second terminus of the chain is juxtaposed to the 
terminal end of the hairpin cap, as shown in figure 1. The 
3* and 5' termini can be, in one embodiment, 

dephosphorylated . In an alternative embodiment, the 3' and 
5' termini can be covalently linked by a phosphodiester bond 
or equivalent, so that the CMV is a closed circular 
oligonucleotide. Segments (f) and (h) can be optionally 
deleted from a closed circular CMV. In a preferred 
embodiment the orientation of the oligonucleobase of the 
hairpin cap is the same as the orientation of the strand to 
which it is linked. If the orientation of the hairpin cap is 
anti -parallel to the orientation of the strand to which it is 
attached, the designation as 3' or 5' of the terminal end of 
the hairpin is determined by the structure of terminal end of 
the complementary. 

In a preferred embodiment, the CMV is a single chain CMV 
containing one hairpin cap, oriented in parallel with the 
chain, having juxtaposed 3' and 5' ends. There are eight 
specific embodiments of this type, which are defined by the 
location of the ligation between the hairpin cap and the 
chain and by whether the sequence of the first strand is the 
sequence of the coding strand or the non-coding strand of the 
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target gene. The eight species are given in Table I . Figure 
1 illustrates species 2 and 6 of Table I. 



TABLE I 



Species 


Location of l.iaation of 




Hairpin Can and Chain 


1 


3 * First Strand 


Ceding 
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5' First Strand 


Coding 


3 


3 ' Second Strand 


Coding 


4 


5 * Second Strand 


Coding 


5 


3 ' First Strand 


Non-coding 


6 


5* First Strand 


Non- coding 


1 


3 ■ Second Strand 


Non- coding 


8 


5 • Second Strand 


Non- coding 



6.2.2. DUAL CHAIN CHIMERIC MUTATIONAL VECTORS 

Alternatively, the CMV can comprise two chains, each 
chain having a 3' and a 5' terminus, wherein a first chain 
comprises the first strand and a second chain comprises a 
second strand. The first and second chains can be cross - 
linked by a covalent linker or the first and second chains 
can be held in association only by Watson-Crick bases 
pairing. The lengths of the regions and segments of the 
first and second strands of a two chain CMV are constructed 
according to the foregoing guides regarding single chain CMV. 
In one embodiment, the first and second chains can further 
comprise complementary segments of between 3 and 10 bases 
that flank the first and second strands that increase the 
stability of the association between the first and second 
chains. 
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An alternative embodiment of the two chain CMV can 
comprise two oligonucleobase chains and two hairpin caps; the 
first strand is a part of the first chain and the second 
strand is a part of the second chain. The hairpin caps can 
both be ligated to the ends of one strand. In one specific 
configuration, termed a "cradle" configuration, a hairpin cap 
is ligated to each end of the second strand. In an "anti- 
cradle" configuration, the hairpin caps are ligated to the 
ends of the first strand. An alternative type of 
configuration, termed "head-to- tail , " consists of a hairpin 
cap ligated to each of the strands. Because the strands of 
the CMV hybridize only in an antiparallel fashion, there are 
only two specific configurations of the head-to-tail type: 
the hairpin caps can both be ligated to either the 3' ends or 
the 5' ends of the strands. 

6.3. THE SYNTHESIS OF CMV AND SELECTION OF NUCLEOBASES 
CMV can be synthesized by any technique used to 
synthesize oligonucleotides or oligonucleotide analogs. For 
CMV having chains up to about 100 bases in length, the 
preferred technique is solid-phase synthesis. Alternatively, 
subsegments of CMV chains greater than about 50 bases in 
length can be synthesized by solid phase synthesis and 
ligated by liquid phases techniques, well known to those 
skilled in the art. Wosnick, M.A. , 1987, Gene 60:115-117. 
As those skilled in the art appreciate, complementary 
subsegments can be synthesized by solid-phase techniques so 
that when annealed, the ends of the subsegments are 
staggered. By causing adjacent subsegments to have 
complementary staggered ends, adjacent segments can be 
ligated by well known enzymatic processes. By this technique 
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a chain of a CMV considerably larger than 100 bases can be 
synthesized. 

The nucleobases of a chain of a CMV can be any 
nucleobase now known or to be developed that hybridizes by 
Watson-Crick base pairing to DNA. Suitable nucleobases 
include nucleotides and nucleotoids. The structure and 
synthesis of oligonucleobases having exemplary nucleotoids 
can be found as follows: Phosphorothioates , Eckstein, F. , 
Ann. Rev. Biochem. , 1985, 54, 367; Phosphoramidates , 
Froehler, B.C., et al . , Nucleic Acid Research, .1988, 16, 
4831; Methylphosphonates, Miller, P.S., et al . , 1985, 
Biochimie, 1985, 67, 769. A method of producing 
oligonucleotoids having chiral-specif ic phosphorus -containing 
linkages is described in U.S. Pat. No. 5,212,295. Chirally- 
specific oligonucleotides having the appropriately selected 
isomer hybridize to DNA with improved stability. 

Pentosefuranosyl containing nucleobases that are linked 
by non- phosphorus nucleobases that can be used as deoxyribo- 
type nucleobases, which are termed nucleosides. Nucleosides 
that form duplexes with DNA that are at least as stable as 
DNA /DNA duplexes are linked by the various linkage 
chemistries. The chemistries and methods for their use in 
oligonucleobases are described as follows: 
methyl hydroxy 1 amine linkages, Vasseur et al . , J. Am. Chem. 
Soc. 1992, 114, 4006, U.S. Pat. Nos . 5,386,023 and 5,489,677; 
alkylene dioxy linkages, U.S. Pat. No. 5,223,618; and 
3 1 -Thioformacetal , Jones et al . , J. Org. Chem. 1993, 58, 
2983 . 

Other nucleosides that can be used in CMV include: 
Carbamates, Stirchak et el., J. Org. Chem. 1987, 52, 4202; 
Sulfonate U Sulfonamide, Glemarec et al . , Tetrahedron 1993, 
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49, 2287, Reynolds et al . , J. Org. Chem. 1992 , 57, 2983; 
Sulfone, Huang, Z., J. Org. Chem. 1991, 56, 3869; Sulfamate, 
Huie, E.M., et al . , J. Org. Chem., 1992, 57, 4569; and 
Diisopropylsilyl & silyl, Cormier and Ogilvie, Nucleic Acids 
Res. 1988, 16, 4583, Ogilvie & Cormier, Tetrahedron Lett. 
1985, 26, 4159. 

Pentosef uranosyl containing nucleobases can be of either 
the ribo-type or 2 ' -deoxyribo- type . At least three ribo-type 
nucleobases for use in CMV must be nuclease resistant. 
Suitable nuclease resistent ribo-type nucleobases can be 
selected from the group of ribonuclease resistant 
nucleobases, consisting of 2 ' AX-nucleosides , 2'AX- 
nucleotoids, 2 'AR-nucleotides, where A= 0, F, CI or Br and 
when A=0, then X=H or C y . 6 and R=C : _ 6 , or when A is a halogen, 
then X and R are omitted. 

Nucleobases that lack a pentosef uranosyl moiety can be 
used as deoxyribo -type nucleobases. Suitable examples 
include replacement of the pentosef uranosyl phosphate moiety 
by a Morpholino carbamate, Wang & Weller, Tetrahedron Lett., 
1991, 32, 7385 and peptide nucleic acids in which the 
pentosef uranosyl phosphate moiety is replaced by an 
aminoethylglycine . Peptide nucleic acids (PNA) are described 
in Egholm et al . , J. Am. Chem. Soc . , 1992, 114, 1895 and 
Huang, B.S. et al . , J. Org. Chem., 1991, 56, 5006 and WO 
92/20703 to Buchardt et al . ; methods of making 
PBA/oligonucleotide chimeric polymers is described in WO 
95/14706. 

Those skilled in the art understand that PNA can 
hybridize to DNA in either orientation, i.e., either end of a 
PNA can be the 3 ' or 5 ' end. Peffer, N.J., et al . , 1993, 
Proc. Natl. Acad. Sci . 90:10648-52. When a peptide 
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nucleobase is present in an oligonucleobase strand having 
pentosefuranosyl -containing nucleobases, the 3' and 5' ends 
of the strand are determined by the orientation of the 
pentosefuranosyl moieties or, if none are present in the 
chain having the peptide nucleobase, then the 3' and 5' ends 
of the strand are determined by the orientation of 
pentosefuranosyl nucleobases of the complementary strand. 
Note, that the first strand of a CMV must contain at least 
three pentosefuranosyl nucleobases. 

7 . EXAMPLES 

Example 7.1. THE USE OF CMV TO REPAIR EPISOMAL ALKALINE 
PHOSPHATASE 

An expression plasmid containing the wild- type human 
liver/bone/kidney alkaline phosphatase cDNA under the control 
of the SV4 0 early promoter was obtained and designated pHAP . 
An identical plasmid with the mutant version of the cDNA was 
obtained and designated p7ll. The design of CMV to 
interconvert the sequences of pHAP and p711 are diagrammed in 
Figure 2A. The CMVChl was designed to repair the missense 
mutation at position 711. It has a G residue, the wild- type 
sequence, at the site corresponding to the mutation. Ch2 has 
a design identical to Chi except for an A instead of G at the 
site corresponding to position 711. Ch3 has the same 
sequence as Chi but the sequence of the ribonucleotide 
segments is that of the coding strand of the alkaline 
phosphate gene instead of the non-coding strand. The 
oligonucleotide Dhl contained the same sequence as Chi, but 
contained only 2 ' -deoxynucleotides . 

The schematic of p71l in Figure 2B shows the single 
point mutation, A, at position 711 in the coding region of 
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the alkaline phosphatase cDNA, the SV40 early promoter (P E ) , 
SV40 origin of replication (ori) , polyadenylation addition 
site and small -t intron sequence for splicing (SV4 0 poly A) . 
The dotted box in Figure 2B indicates the sequence from 
PBR322 encoding the origin of replication and 3-lactamase 
(Amp") gene. CHO cells were transfected with p7li and 6 h 
later the CMV, Chi, was introduced to CHO cells previously 
trar.srecred with p71l. Both transf ections were performed 
with lipofectin. The extent of the conversion to the wild- 
type phenotype was monitored at both biochemical and DNA 
sequer.re levels, by spectrophotometry measurement, 
rus-cchemical staining and analysis of Hirt DNA. 

MATERIALS AND METHODS 

Synthesis and purification of oligonucleotides: The 
chimeric oligonucleotides were synthesized on a 0.2 /zmole 
scale using the 1000 A wide pore CPG on the ABI 3 94 DNA/RNA 
synthesizer. The exocyclic amine groups of DNA 
phosphoramidites (Applied Biosystems, Foster City, CA) are 
protected with benzoyl for adenine and cytidine and 
isobutyryl for guanine. The 2 • -O-methyl RNA phosphoramidites 
(Glen Research, Sterling, VA) are protected with 
phenoxyacetyl group for adenine, dimethyl formamidine for 
guanine and isobutyryl for cytidine. After the synthesis was 
complete, the base-protecting groups were removed by heating 
in ethanol : concentrated ammonium hydroxide (1:3) for 20 h at 
55°C. The crude oligonucleotide sample was mixed with 7M 
urea and 10% glycerol, heated to 70°C and loaded on a 10% 
polyacrylamide gel containing 7M urea. After gel 
electrophoresis, DNA bands were visualized by UV shadowing, 
dissected from the gel, crushed and eluted overnight in TE 
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buffer (10 mM Tris-HCl and 1 mM EDTA, pK 7.5) with shaking. 
The eluent containing gel pieces was spun through .45 spin 
filter (Millipore, Bedford, MA) and precipitated with 
ethanol. Samples were further desalted by G-25 spin column 
(Boehringer Mannheim, Indianapolis, IN) and greater than 95% 
of the purified oligonucleotides were found to be full 
length . 

Transient transfection and histochemical staining: GHO 
cells were maintained in DMEM (B.R.L., Bethesda, MD) 
containing 10% FBS (B.R.L. Bethesda, MD) . Transient 
transfection was carried out by addition of 10 fig of 
lipofectin in 1ml of OPTIMEM and added to each well. He 
alkaline phosphatase activity was measured 24h after 
transfection of the oligonucleotide. For histochemical 
staining, the cells were washed with 0.15 M NaCl three times, 
incubated with staining solution for 20 rnin and fixed with 
50% ethanol . The staining solution consisted of 2mg Fast 
Violet, 2ml Naphtol AS-MX phosphate alkaline solution (Sigma 
Chemical Company, St. Louis, MO) in 50 ml of water. 

Spec trophotome trie measurement of alkaline phosphate 
activity: Transient transfection was carried out in 
triplicates by addition of 1 /ig of the plasmid p71l with Ijig 
of lipofectin in 100 al of OPTIMEM (B.R.L. Bethesda, MD) to 
lxlO 4 CHO cells in a 96-well place. After 6h, various 
amounts of Chi or other CMV were mixed with 1 ^9 of 
lipofectin in 100 fil of OPTIMEM and added to each well. 
After 18h, the medium was aspirated and 200 /zl of DMEM 
containing 10% FBS was added to each well. The alkaline 
phosphate activity was measured 24h after transfection of 

. 24 - 



9748714A1 I > 



WO 97/48714 



PCT/US97/10538 



chimeric oligonucleotides. Spectrophotometry measurement 
was carried out by the Elisa Amplification System (B.R.L, 
Bethesda, MD) . Cells were washed with 0.15 M NaCl three times 
and lysed in 100 M l of NP40 buffer containing 10 mM NaCl , 
0.5% NP40, 3mM MgCl2 and 10 mM Tris-HCl pH 7 . 5 . A fraction 
of cell lysates (20 M l) was incubated with 50 M l of Elisa 
substrate and 50 M l of Elisa amplifier (B.R.L. Bethesda, MD) , 
the reaction was stopped by addition of 50 nl of 0.3 M H ,S0 4 
after 5 min of incubation with amplifier. The extent of 
reaction was carried out within the linear range of the 
detection method. The absorbance was read by an Elisa Plate 
Reader (B.R.L. Bethesda, MD) at a wavelength of 490 nm. 

Hirt DNA isolation, colony hybridization and direct DNA 
sequencing of PCR fragment: The cells were harvested for 
vector DNA isolation by a modified alkaline lysis procedure 
24 h after transfection with the chimeric oligonucleotide. 
The cells were detached by trypsinization, washed, and 
resuspended in 100 „1 of a solution containing 50mM Tris-HCl 
PH 8.0, 10mM EDTA and 110 „l of a solution containing 50 mM 
Tri-HCl pH 8.0, 10 mM EDTA and 100 pg/ml of RNase A. An 
equal volume of cell lysis solution (0.2N NaOH and 1% SDS) 
was added, followed by 100 M l of neutralization solution (3M 
KAc, pH 5.5). A 10-min room temperature incubation was 
followed by centrifugation of 10,000 rpm for 10 min. The 
supernatant was extracted with an equal volume of phenol - 
chloroform and precipitated with ethanol . Hirt DNA was 
transformed into E. coli DH5or cells (B.R.L. Bethesda, MD) . 
Colonies from Hirt DNA were screened for specific 
hybridization for each probe designed to distinguish the 
point mutation. Colonies were grown on ampicillin plates, 
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lifted onto nitrocellulose filter paper in duplicate, and 
processed for colony hybridization. The blots were 
hybridized to 32 P-end-labelled oligonucleotide probes, 711-A 
{ 5 ' - CCGCCTAC&CCCACTCG- 3 ' (SEQ ID NO : 1 ) ) or 711-G (5 ' - 
CCGCCTACGCCCACTCG-3 * (SEQ ID NO : 2 ) ) at 37°C in solution 
containing 5x Denhardts, 1% SDS, 2x SSC and 100 ^g/ml 
denatured salmon sperm DNA. Blots were washed az 52 °C in 
7 MAC solution (3.0M tetramethylammonium chloride/ 50 mM Tris- 
HC1 , pH e.C, 2mM EDTA and 0.1% SDS) . Plasmid DNA was made 
fror twenty colonies shown to hybridize to either 711-G or 
711-A, using the Qiagen miniprep kit (Chatworth, CA) . 
Several hundred bases flanking the 711 position of each 
plasmid were sequenced in both direction by automatic 
sequencing { ABI 373A, Applied Biosystem, Foster City, CA) . A 
190 bp PCR- amplified fragment was generated by Vent R 
polymerase (New England Biolabs, Beverly, MA) utilizing two 
primers ( 5 ' -CAATGTCCCTGATGTTATGCA-3 1 (SEQ ID NO: 3) and 5'- 
CGCTGGGCCAAGGACGCT-3 ' (SEQ IDN0:4)), corresponding to 
position 63 0-650 and 803-822 of the alkaline phosphatase cDNA 
flanking the 711 position. The fragment was gel-purified and 
subjected to automatic DNA sequencing (ABI 373A, Applied 
Biosystem, Foster City, CA) . 

Oligonucleotide stability measurement: Ten ng cf the 32 P-end- 
labelled oligonucleotide was mixed with 500 ng cf the 
unlabelled oligonucleotide and transfected as described 
above. In order to reduce a nonspecific binding of 
oligonucleotides, cells were washed extensively with PBS and 
a solution containing 1 M NaCl/HAc pH 2,5. A crude lysate 
was prepared by lysing the cells in a solution containing 
lOmM Tris-HCl pH 7.5, 0.5 mM MgCl 2 and 0.5% Triton x-100 
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followed by phenol -chloroform extraction. Lysates were 
analyzed by 15% polyacrylamide gel containing 7M urea 
followed by autoradiography. Oligonucleotides incubated in 
DMEM containing 10% FBS were processed and analyzed in the 
same manner . 

In our experimental design, various chimeric 
oligonucleotides were introduced into CHO cells previously 
transfected with P 7ii. The extent of the conversion to the 
wild-type phenotype was monitored by histochemical staining; 
red pigment was deposited on the cells expressing an active 
enzyme. When cells with he mutant gene were transfected with 
Chi, red cells appeared at a frequency, of approximately one 
in three transfected CHO cells, on the average, at 11 nM. In 
contrast, neither Ch2 nor Dhl caused an increased enzymatic 
activity. Conversion to wild- type was observed at a low 
level when cells were transfected with Ch3. The transfection 
frequency measured by the expression of the wild-type plasmid 
pHAP was estimated to be 30%. 

The enzymatic activity was also measured by 
spectrophotometric method described above. A dose-dependent 
increase of alkaline phosphatase activity was observed up to 
17 nM of Chi in the presence of p7ii plasmid. The enzymatic 
activity of cells treated with Chi at 17nM approached 60% of 
that observed from cells transfected with the wild-type 
plasmid, pHAP. The increase was sequence-specific since the 
same amount of Chi did not affect enzymatic activity of cells 
transfected with pHAP. Furthermore, Ch2 which contained a 
single base pair change from the Chi sequence did not cause 
any increase in enzymatic activity. The oligonucleotide, 
Dhl, which contained the same sequence as Chi, but did not 
contain a ribonucleotide segment, did not exhibit an 
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increase. Thus, spectrophotometry measurements of alkaline 
phosphatase activity were consistent with the result from the 
histochemical staining . 

Correction of a point mutation of the targeted DNA sequence 
by the chimeric oligonucleotide: In order to confirm the 
change at the DNA sequence level, a Hirt extract was made 
from the cells transfected with the p711 and various 
oligonucleotides by a modified alkaline lysis procedure, 
Wang. G, et al . , 1995, Mol . Cell. Biol. 15, 1759, 24 h after 
transfection of the chimeric oligonucleotide. Hirt DNA 
transformed DH5a cells efficiently, resulting in 10 4 Amp" 
colonies from 10 6 transfected CHO cells. DH5a transf ormants 
were screened for specific hybridization with a probe 
designed to distinguish between the point mutation (A) and 
the wild-type (G) sequence, corresponding to position 703-719 
of mutant and normal cDNAs, respectively, Weiss, MJ., 2986, 
Proc. Natl. Acad. Sci. 85:7666. The frequency of correction 
was measured by averaging the number of colonies hybridized 
to the 711-G or 711 -A probe using more than 500 colonies 
hybridized to the 711-G or 711-A probe using more than 500 
colonies of the multiple plates generated from at least two 
separate transfection experiments (Table I) . Similar 
frequencies of conversion were observed for two batches of 
Chi prepared by separate synthesis. Approximately 70% of the 
colonies generated from the Hirt DNA made from cells 
transfected with p7ll and Chi hybridized to the 711-A probe, 
while 30% of colonies exhibited hybridization to the 711-G 
probe (Table I) . Thus, a correction frequency of 3 0% was 
observed at llnM of Chi, reproducibly . Hybridization was 
specific and no cross-hybridization was observed between the 
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two populations. DNA sequencing was carried out with plasmid 
DNAs prepared from twenty of these colonies in both 
directions utilizing two primers (5 ' -CAATGTCCCTGATGTTATGCA-3 ' 
(SEQ ID N0:5) and 5 ' -CGCTGGGCCAAGGACGCT- 3 ' (SEQ ID N0:6)), 
corresponding to position 630-650 and 803-822 of the alkaline 
phosphatase cDNA flanking the 711 position. The sequence 
conversion was confirmed in each case and no other alteration 
in sequence was observed within several hundred of bases 
surrounding the target nucleotide. All colonies from the 
Hirt extract prepared from Ch2 or Dhl -treated cells 
hybridized tc the 711-A probe only (Table I). Some colonies 
from the Hirt extract of the Ch3 hybridized to the wild-type 
probe, but to a much lesser extent than that of the Chi 
(Table II) . These results confirmed that the differential 
alkaline phosphatase activities exhibited were due to the 
correction of the point mutation (A to G) at the DNA sequence 
level . 
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Table II. Hybridization pattern of transf ormant s 
from Hirt extract prepared from duplicate 
transf ections of the p711 plasmid and various 
oligonucleotides at 11 nM. 
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2 
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3 
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189 


0 


185 
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143 


0 


143 


0 


Dhl 


1 


217 


0 


217 


0 






180 


0 


180 


0 




2 


157 


0 


157 


0 






188 


0 


188 


0 



RecA-def icient E. coli strains used to propagate plasmid 
DNA are capable of repair and homologous pairing functions 
using episomal DNA. In order to rule out the possibility 
that the sequence conversion is mediated by E. coli, direct 
DNA sequencing of a PCR-amplif ied fragment of Hirt DNA was 
carried out. Two primers flanking the 711 position were 
utilized to generate a 190 bp fragment through the action of 
Vent R polymerase. The results indicated that position 711 
was a mixture of A (70%) and G(30%) when the Hirt DNA sample 
was made from the cells transfected cells with the 
combination of p711 and Chi. In contrast, no mixed sequence 
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was observed at position 711 when Hirt DNA was made from 
oligonucleotide Dhl . These results established clearly that 
sequence correction by the chimeric oligonucleotide occurred 
in mammalian c ells. 

Stability of chimeric oligonucleotide: The stability of the 
chimeric oligonucleotide was measured intracellularly and in 
growth medium containing 10% FBS . Ten nanogram of 
radiolabelled oligonucleotide, Chi, was added to the same 
transfection experiment in which histochemical staining and 
Hirt DNA analyses was conducted (see Materials and Methods) . 
The chimeric oligonucleotides are extremely stable. No 
detectable degradation was observed when chimeric 
oligonucleotide was incubated in growth medium containing 10% 
FBS, after 24 h incubation. Moreover, oligonucleotide 
isolated from cells did not exhibit any degradation during 
the same incubation time. Only monomers of the chimeric 
oligonucleotide were detected when isolate from cells 24 h 
after incubation. Thus, under the experimental conditions 
employed here, no end-to-end litigation of chimeric 
oligonucleotides was observed. 

Example 7.2. THE USE OF CMV TO MUTATE THE fc-GLOBIN GENE 
IN EBV- TRANSFORMED CELL LINES 

A CMV designed to repair the mutation found in Sickle 

Cell Disease 15-globin was designed SCI, Figure 3. The 

molecule was composed of DNA residues with two intervening 

blocks of ten 2 , -0-methyl RNA residues flanking a short 

stretch of five DNA residues. When the molecule was folded 

into the duplex conformation, one strand contained only DNA 

residues while the other strand contained the RNA /DNA blocks. 
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In this case, the internal sequence is complementary to the 
S s globin sequence over a stretch of 25 residues that span 
the site of the S s mutation, with the exception of a single 
base (T) which is in bold and designated with an asterisk. 
The five DNA residues flanked by RNA residues were centered 
about the mutant T residue in the S s coding sequence. A 
control chimeric oligonucleotide (SC2) was designed in the 
same manner with the exception of the base (A) designated in 
bold and with an asterisk. Genomic sequences of the S A , £ s , 
and closely-related 6-globin genes are also displayed in 
Figure 3A with the specific site of 6 n mutation printed in 
bold. 

Lymphoblastoid cells were prepared as follows. Heparin- 
treated blood was obtained from discarded clinical material 
of a patient with sickle cell disease and from one of the 
investigators who had neither history nor symptoms of the 
disease. Mononuclear cells were prepared from blood (=8 ml) 
by density gradient centrif ugation in Ficoll and infected 
with Epstein-Barr virus which had been propagated in the 
marmoset cell line B95-8 (Coriell Institute for Medical 
Research #GM07404D) . Infections were performed with addition 
of 0.1 mg leucoagglutinin PHA-L in 10 ml RPMI medium 
supplemented with 20% fetal bovine serum in a T25 flask. 
Cultures were fed twice a week starting on day 5 and were 
considered established once 60-70% of the cells remained 
viable at day 21. The 6 A and E s lymphoblastoid cells were 
maintained in RPMI medium containing 10% fetal bovine serum. 

The CMV was introduced into the above -described 
lymphoblastoid cells homozygous for the S s allele as follows. 
Cells (1 x 10 s per ml) were seeded in 1ml of medium in each 
well of a 24 -well tissue culture plate the day prior to the 
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experiment. Transf ect ions were performed by mixing chimeric 
oligonucleotides with 3 mg of DOTAP (N-[l-<2,3- 
Dioleoyloxy) propyl ] -N, N, N- trimethyl ammonium methylsulf ate , 
Boehringer-Mannheim) in 20 ml of 20 mM HEPES, pH 7.3, 
incubated at room temperature for 15 min, and added to the 
cultured cells. After 6 h the cells were harvested by 
centrifugation, washed and prepared for PCR amplification 
following the procedure of E.S. Kawasaki, PCR Protocols, Eds. 
M . A . Innis, D.H. Gelfand, J.J. Sninsky and T.J. White, ppl46- 
152, Academic Press, (1990). 

Correction of the single base mutation was assessed by 
taking advantage of well known restriction fragment length 
polymorphisms resulting from the £ s mutation, R.F. Greeves et 
al., 1981, Proc. Natl. Acad. Sci . 78:5081; J.C. Chang and 
Y.W. Kan, 1982, N. Eng. J. Med. 307:30; S.H. Orkin et al . , 
ibid., p. 32; J.T. Wilson et al . # 1982, Proc. Natl. Acad. 
Sci. 79:3628. The A to T transversion in the fi s allele 
results in the loss of a Bsu36I restriction site (CCTGAGG) . 
Thus, the £ s allele can be detected by Southern hybridization 
analysis of genomic DNA cut with Bsu36I. A 1.2 kbp Bsu36I 
DNA fragment of the S-globin gene present normally is absent 
in the 6 s allele and is replaced by a diagnostic 1.4 kbp 
fragment. When genomic DNA recovered from homozygous S s 
lymphoblastoid cells was analyzed by this procedure, the 
expected 1.4 kbp fragment was observed. However, two 
fragments were observed in DNA from cells transfected with 
the SCI CMV. The presence of the 1.2 kbp fragment in 
addition to the 1.4 kbp fragment indicates partial correction 
of the £ s allele had taken place in a dose-dependent fashion. 

To measure the efficiency of correction rapidly and 
sensitively, we adapted a PCR-based RFLP analysis. For the 
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analysis of the 6-globin sequence, the 34 5 bp PCR fragment 
was prepared by amplification from a crude cell lysate using 
primers BG02 (5 ' -TCCTAAGCCAGTGCCAGAAGA-3 1 (SEQ ID N0:7)) and 
BG05 ( 5 1 -CTATTGGTCTCCTTAAACCTG-3 1 (SEQ ID NO:8)) and Expand 
Taq polymerase (Boehringer Mannheim) . For the analysis of 
the 5-globin gene, the same cell extracts were used in 
amplification reactions with primers DG06 (5 ' - 
CTCACAAACTAATGAAACCCTGC-3 1 (SEQ ID N0:9)) and DG07 (5'- 
GAAAACAGCCCAAGGGACAG-3 ' (SEQ ID NO:10)) to generate a 335 bp 
fragment. Gels were stained with SYBR™ green (FMC 
Bioproducts) and fluorescence intensities were quantitated 
using a Molecular Dynamics fluoroimager. DNA sequencing was 
performed in both directions using an ABI 373A sequencer. 

The above primers were designed to yield a 345 bp 
fragment spanning the site of the 6 s mutation after PCR 
amplification of genomic DNA. The fragment from normal cells 
contained a Bsu36I recognition sequence and yielded fragments 
of 22 8 bp and 117 bp, while DNA from S s gene contained the 
sequence CCTGTGG and remained refractory to cutting. 
Analysis indicated that the 345 bp DNA fragment amplified 
from SCl-treated £ s cells was partially cleaved with Bsu36I, 
indicating correction of the mutation on some, but not all, 
chromosomes. A quantitative measure was obtained by 
comparing the relative intensities of the three DNA fragments 
after electrophoret ic separation and staining with the 
fluorescent dye SYBR™ green. The stained bands were imaged 
using a laser fluoroimager and the relative levels were 
calculated. Conversion efficiency was quantitated by 
scanning the cyber green-stained agarose gel with a 
fluoroimager. Experiments at doses of between 2\ 5 and 2 5.0 
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pM of SCI /10 s S s lymphoblastoid cells showed between about 
40% and 55% conversion of 15 s to 6 A (Figure 4A) . 

The frequency of introduction of the sickle mutation by 
the CMV SC2 was also determined by the methods set forth 
above. Analysis indicated that the level of correction 
exceeded 50% at the highest level of input chimeric molecule, 
25 nM, but even at 2.5 nM correction of 3.0% of S-globin genes 
was observed (Figure 4B) . 

Direct sequencing of the PCR amplified 345 bp fragment 
was performed to confirm the T to A change in the coding 
strand. In the DNA sample from 6 s cells transfected with 
chimeric molecule SCI at a higher concentration greater than 
12 nM/10* cells. Sequence analysis revealed an approximately 
equal mixture of A and T residues at the site of the 8 s 
mutation. DNA from untreated S s cells contained only T at 
that position and DNA from S* cells contained only A when 
treated with SCI. Treatment of S« cells transfected with the 
control CMV SC2 caused no change in the S-globin gene 
sequence. However, the DNA from normal cells transfected 
with SC2 was partially converted to the 15 s mutant sequence as 
evidenced by a mixture of T and A residues at the expected 
position of the sequence. 

The specificity of the action of CMV was assessed by 
sequencing the related 6-globin gene, which is more than 90% 
homologous to the S-globin gene. The S and 6 globin genes 
are identical over the 5 bp DNA core targeting region of SCI. 
Two single base differences are underlined in Figure 3. To 
determine whether SC2 altered the 6-globin gene, DNA sequence 
analysis was performed as above. The results showed that no 
alteration was introduced into the 5 -globin gene "by the SC2 
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CMV in contrast to the observed change directed by SC2 in the 
S A -globin sequence. 

Example 7.3. THE EXPERIMENTAL USE OF CMV TO 
MUTATE THE 15-GLOBIN GENE OF A 
HSC 

METHODS AND MATERIALS 

Stem cell isolation and transfection: Normal volunteers 
were given G-CSF 300 fig S.C. twice a day for five days. On 
the fourth and fifth days of G-CSF therapy they underwent a 
four hour stem cell apheresis using a COBE spectra pheresis 
machine. Mononuclear cells were prepared by density gradient 
centrif ugation on Ficoll -Hypaque (density 1.077 g/ml, 
Pharmacia) (2000 rpm, 10 min, room temperature) . The 
majority of the monocytes were removed after adherence to 
plastics (30 min, 37° C in 5% C0 2 in RPMI with 10% FBS) . 
Cells were harvested by swirling to remove cells loosely 
adherent to the plastics which were washed 3 times with PBS. 
This population was incubated with biotinylated murine anti- 
CD34 antibodies in PBS/1% BSA for 25 min at room temperature 
at a concentration of 100 x 10" cells/ml. The antibody- 
treated cells were passed over an avidin column and those 
passing through the column were then collected for analysis. 
Subsequently the column was washed with PBS and CD34* cells 
adhering to the column were recovered by squeezing the 
column. Final purities were assessed by FACS . 

Cells were re-suspended in RPMI with 10% FCS heat 
inactivated and 1 x 10 s cells/ml were plated in a 24 well 
pate with each well receiving 1 x 10 s cells. The indicated 
amounts of chimeric oligonucleotide were mixed with 3 /ig 
DOTAP in 20 jxl of 20 mM HEPES, pH 7.3. The mixture was 
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incubated on ice for 15 minutes then added to the cells. 
After 16 hr at 37° C, 5% C0 J( the cells were harvested, 
pelleted, washed with PBS and lysed with lysis buffer. 

PCR Amplification and Analyses: Genomic DNA was 
amplified for PCR by using . PC0 2 (5 1 -TCCTAAGCCAGTGCCAGAAGA-3 ' 
(SEQ ID NO: 11)) and PCO s ( 5 ' - CTATTGGTCTCCTTAAACCTG - 3 ' (SEQ ID 
NO: 12)) respectively and Expand Taq polymerase (Boehringer 
Mannheim, Indianapolis, IN) in a 50 /il reaction at 94° c for 
30 sec, 52.5° C for 30 sec, 72° C for 30 sec for 35 cycles to 
generate a 34 5 bp fragment. For the 5 locus, the 5' primer 
was 5 ' - CTCACAAACCTAATGAAACCCTGC - 3 ' (SEQ ID NO: 13) and the 3' 
primer was 5 ' -GAAAACAGCCCAAGGGACAG-3 ' (SEQ ID NO:14) at 94° C 
for 30 sec, 59 c C for 30 sec, 72° C for 30 sec for 35 cycles. 

The PCR product was digested with either Dde I or BSU36I 
restriction endonucleases (New England Biolabs, Beverly, MA) 
and loaded onto 1.2% agarose gel (IX TBE) and 
electrophoresed. The gel was stained for 20 min in 200 ml of 
IX TBE containing 1:20,000 cyber green strain (FMC, Rockland, 
ME) in the dark and quantitated by fluoroimager (Molecular 
Dynamics, Sunnyvale, CA) . The PCR product was spun through a 
Qiaquick PCR purification spin column (Qiagen, Chatsworth, 
CA) in H 3 0, dried down by vacuum to 5>1 and the 
concentration determined spectroscopically by O.D at 260 nm. 
The DNA samples (30 fig) were sequenced directly by an 
automated Applied Biosystems Model 373A DNA sequencing system 
(Applied Biosystems, Foster City, CA) . 

Synthesis and purification of oligonucleotides: The chimeric 
oligonucleotides were synthesized on a 0.2 M mole scale using 
the 1000 A wide pore CPG on the ABI 394 DNA/RNA synthesizer. 
In this construct, the exocyclic amine groups of DNA 
phosphoramidites (Applied Biosystems) are protected with 
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benzoyl for adenine and cytidine and isobutyryl for guanine. 
The 2'-0-methyl RNA phosphoramidites (Glen Research, Sterling, 
VA) are protected with phenoxyacetyl group for adenine, 
dimethyl formamidine for guanine and isobutyryl for cytidine. 
After synthesis, the base-protecting groups were removed by 
heating in ethanol : concentrated ammonium hydroxide (1:3) for 
20 h at 55° C. The crude oligonucleotides were purified by 
polymery 1 amide gel electrophoresis and the sample was mixed 
with 7tf urea and 10% glycerol, heated to 70° C and loaded on 
a 10* polyacrylamide gel containing 7M urea. After gel 
electrophoresis, DNA bands were visualized by UV shadowing, 
dissected from the gel, crushed and eluted overnight in TE 
buffer (10 mM Tris-HCl and ImM EDTA, pH 7.5) with shaking. 
The eluent containing gel pieces was spun through 0.45 /im 
spin filter (Millipore, Bedford, MA) and precipitated with 
ethanol. Samples were further desalted by G-25 spin column 
(Boehringer Mannheim) and greater than 95% of the purified 
oligonucleotides were found to be full length. 
RESULTS: The isolated CD34*-enriched population was utilized 
first in an oligonucleotide uptake experiment. The chimeric 
molecule SC2 was mixed with the liposome formulation DOTAP 
under the conditions described above except that a 
radioactive tag was placed at the 5' end of the 
oligonucleotide. Increasing amounts of labeled and unlabeled 
oligonucleotide were incubated with the liposome for 
15 minutes. The mixture was then incubated with cells for 
6 hours after which the cells were washed extensively with 
PBS to reduce nonspecific binding. The cells were then 
centrifuged and the pellet fraction was washed with 0.2 M 
glycine (pH 4.5) to eliminate any remaining nonspecific 
binding. The radioactivity in the cell pellet was determined 
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by scintillation counting. The chimeric oligonucleotide was 
absorbed by the cell in dose-dependent fashion. Since our 
experimental strategy focused on nanomolar concentrations, we 
did not extend the curve beyond 25 nM. Based on the specific 
activity of the radiolabeled chimeric oligonucleotide and 
assuming that each cell is equally receptive to 
transformation, we estimate that up to approximately 50% of 
the.CD34* cell population was transfected with the substrate. 
For each experiment, background levels were assessed by 
mixing radiolabelled chimeric molecules with the cells in the 
absence of DOTAP and this level never exceeded 0.05%. 

A population of CD34*-enriched cells containing two 
alleles with P A genotype were transfected with various 
amounts of SC2 and 3 /ig/ml of DOTAP. Genomic DNA was 
isolated 16 h after transfection as described above and the 
extent of 3 A to 3 s conversion was measured by restriction 
enzyme polymorphism and by direct DNA sequencing. Genomic 
DNA isolated from 10 s cells was subjected to PCR 
amplification generating a 345 bp fragment by using the two 
primers PC0 2 and PCO s . The 3 A -specific sequence is cleaved by 
the restriction enzyme Dde I resulting in three fragments of 
192, 108 and 4.5 base pairs, respectively, while the 3 s - 
sequence would be cleaved only once, leaving a 300 bp and a 
45 bp fragment. An increasing level of the uncut 300 bp 
fragment was observed as a function of increasing 
concentrations of SC2, indicating conversion of the 3 A to 3 s 
genotype, Figure 5. A 50% frequency of conversion was 
observed at relatively low concentrations of chimeric 
oligonucleotide (600 ng = 30 nM x 1 ml) . In contrast, no 
conversion was observed in cells treated with SCI, a chimeric 
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molecule which pairs to the 3 A site with perfect 
complementarity . 

In order to confirm the DNA sequence change (A to T) in 
the normal cells, direct DNA sequencing of the 345 bp- 
fragment was carried out. The CD34* population containing 
homozygous 3 A alleles was transfected with 23 nM SC2 as 
described. Genomic DNA was isolated, PCR-amplif ied and the 
samples subjected to automated DNA sequencing. The DNA 
sequence of 3 A alone and 3 A treated with the SCI both 
contained T. In contrast, the DNA sequence of the 3 A cells 
treated with the SC2 showed a dose -dependent conversion of T 
to A at the expected position. The SC2 CMV contains an (a) 
segment which is identical to the ceding strand of the 3- 
globin gene. The CMV designated SC5 contained an (a) segment 
identical to a fragment of the non- coding strand of the £- 
globin gene. We repeated the transf ection .experiments 
described above with SC2 and SC5 . The results, presented in 
Figure 5, show that SC5 was active, though not as active as 
SC2 , and was apparently inactive at concentrations below 
20 nM. 

Genomic DNA from 3 A cells that had been treated with SC2 
was PCR-amplif ied utilizing the two 6-globin specific 
primers, PC0 S and PC0 7 . Only wild- type 6-globin sequence was 
found, which confirmed that the SC2 CMV is 3 -globin specific. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Kmiec, Eric B. 

(ii) TITLE OF INVENTION: CHIMERIC MUTATIONAL VECTORS HAVING 
NON -NATURAL NUCLEOTIDES 

(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 11S5 Avenue of the Americas 
(CJ CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036-2711 

tv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM : PC- DOS /MS - DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: On Even Date Herewith 

(C) CLASSIFICATION: 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 869-9741/8864 
(C> TELEX: 66141 PENNIE 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CCGCCTACAC CCACTCG 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CCGCCTACGC CCACTCG 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

' (ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
CAATGTCCCT GATGTTATGC A 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:4: 
CGCTGGGCCA AGGACGCT 
(2) INFORMATION FOR SEQ ID NO : 5 : 

tij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CAATGTCCCT GATGTTATGC A 21 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(x) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CGCTGGGCCA AGGACGCT 

16 

(2) INFORMATION FOR SEQ ID NO: 7: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TCCTAAGCCA GTGCCAGAAG A 

21 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CTATTGGTCT CCTTAAACCT G 

21 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CTCACAAACT AATGAAACCC TGC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GAAAACAGCC CAAGGGACAG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TCCTAAGCCA GTGCCAGAAG A 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CTATTGGTCT CCTTAAACCT G 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 
CTCACAAACC TAATGAAACC CTGC 

24 

(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 20 base pairs 
\B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iii MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GAAAACAGCC CAAGGGACAG 

20 



BNSDCCID: <WO 97487 14A1_L> 



- 45 - 



WO 97/48714 PCI7US97/ 10538 



(2) INFORMATION FOR SEQ ID NOilBi 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH) 68 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESSx single 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE t DNA 
(ix) FEATURES 

(A) NAME /KEYs Chi 

(B) LOCATIONS 1...68 
(D) OTHER INFORMATIONS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOtlS: 

AGCGCCGCCT ACGCCCACTC GGCTGTTTTC AGCAGCGUGG GCGTAGGCGG CGCUGCGCGT 60 
TTTCGCGC 68 

(2) INFORMATION FOR SEQ ID NO: 16: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairB 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURE: 

(A) NAME /KEY : Ch2 

(B) LOCATIONS 1...68 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

AGCGCCGCCT ACACCCACTC GGCTGTTTTC AGCCGAGUGG GTGTAGGCGG CGCUGCGCGT 60 
TTTCGCGC 68 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURE: 

(A) NAME /KEY : Ch3 

(B) LOCATIONS 1...6B 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GCGCGTTTTC GCGCAGCGCC GCCUACGCCC ACUCGGCUGT TTTCAGCCGA GTGGGCGTAG 60 
GCGGCGCT 6 

(2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 68 base pairs 

(B) TYPES nucleic acid 

(C) STRANDEDNESS t single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURES 

(A) NAME /KEYs Dhl 

(B) LOCATIONS 1...66 
(D) OTHER INFORMATIONS 

(xi) SEQUENCE DESCRIPTION s SEQ ID NOslBs 

^^£CCCT ACCCCCACTC CGCTCTTTTC ACCCGAGTOC GCGTAGGCGC CGCTCCGCCT 



(2) INFORMATION FOR SEQ ID NOil9s 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ACCCCCAGCG CCGCCTACAC CCACTCGGCT GACCGG 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURE: 

(A) NAME/KEY: SCI 

(B) LOCATION: 1...68 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs20: 
?S^ CC TGAGGAGAAG TCTCCTTTTG CAGACUUCUC CTCAGGAGUC AGGUGCGCGT 



(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURE: 

(A) NAME /KEY: SC2 

(B) LOCATION: 1 . . . 68 
(D) OTHER INFORMATION: 
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(Xi) SEQUENCE DESCRIPTION! SEQ ID NOs21i 

ACCTGACTCC TGTGGAGAAG TCTCCTTTTC CAGACUUCUC CACACGACUC ACOUOCOCCT 60 
TTTCGCGC 68 

(2) INFORMATION FOR SEQ ID NO: 22 J 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURES 

(A) NAME/KEY: SC3 

(B) LOCATION: 1...68 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATCTGACTCC TGAGGAGAAG ACTGCTTTTG CAGUCUUCUC CTCAGGAGUC AGAUGCGCGT 
TTTCGCGC 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURE: 

<A) NAME /KEY : SC4 
<B) LOCATION: 1. . .68 
(D) OTHER INFORMATION : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

ACCTCACTCC TGAGGAGAAG ACTGCTTTTG CAGUCUUCUC CTCAGGAGUC AGGUGCGCGT 
TTTCGCGC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 baBe pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURE: 

(A) NAME /KEY : SC5 

(B) LOCATION: 1...68 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
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CCCCGTTTTC GCGCACCUGA CUCCTGTGGA GAAGUCUGCT TTTGCAGACT TCTCCACAGG 60 
AGTCAGGT 68 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH : 25 base patre 

(B) TYPE i nucleic acid 

(C) STRAND ED NESS t single 

(D) TOPOLOGY t linear 

(ii) MOLECULE TYPE i DNA 
<ix) FEATURE: 

(A) NAME /KEY: Delta 

(B) LOCATION: 1. ..25 
(D) OTHER INFORMATION: 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
ATCTCACTCC TCAGGAGAAG ACTGC 25 
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WHAT IS CLAIMED IS: 

1. An oligonucleobase compound for introducing an alteration 
in a gene of a eukaryotic cell, which comprises: 

a) a first strand of nucleobases having a 2' end and a 5* 
end, which first strand comprises: 

1) a total of at least 15 nucleobases; 

2) at least three nuclease resistant ribo-type 
nucleobases that are selected from the group 
consisting of 2 ■ AX-nucleosides , 2 ■ AX-nucleotoids # 
and 2 • AR-nucleot ides , wherein A=0, F, CI, or Br 
and, when A=0, then X=H or CV 6 alkane and R=C, b 
alkane, and when A*0, then X and R are omitted; 
and 

3) at least 3 contiguous ribo-type nucleobases, which 
can be the same as or in addition tc the nuclease 
resistant ribo-type nucleobases; and 

b) a second strand of nucleobases having a 2 ' end and a 
5' end, wherein the nucleobases of the second strand 
are Watson-Crick paired to the nucleobases of the 
first strand, 

in which the contiguous ribo-type nucleobases cf the first 
strand are Watson-Crick paired to 2 ' -deoxyribo-cype 
nucleobases and in which at least one ribo-type nucleobase is 
other than a 2'0-methyl substituted nucleotide. 

2. The compound of claim 1, wherein the first strand is a 
region of a oligonucleobase chain, which chain comprises a 3' 
and a 5 ' terminus . 

3. The compound of claim 2, in which the first strand 
comprises at least two segments of at least 3 contiguous 
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ribo-type nucleobases, which ribo-type nucleobases are 
Watson-Crick base paired to 2 ' -deoxyribo-type nucleobases. 

4. The compound of claim 3, in which each ribo-type 
nucleobase of the first strand is Watson-Crick paired to a 
deoxyribo- type nucleobase of the second strand. 

5. The compound of claim 4, in which each ribo-type 
nucleobase of the first strand is selected from the group 
consisting of 2 ' AX-nucleosides , 2 * AX-nucleotoids, and 2 'AR- 
nucleotides, wherein A=0, F, CI, or Br and, when A=0, then 
X^H or C : - alkane and R=C llfi alkane, and when A*0, then X and 
R are omitted. 

6. The compound of claim 5, which comprises a first chain 
having a first 3' terminus and a first 5' terminus, which 
first chain comprises the first strand, and a second chain, 
having a second 3 1 terminus and a second 5' terminus, which 
comprises the second strand. 

7. The compound of claim 5, in which for at least one ribo- 
type nucleobase A=F. 

8. The compound of claim 6, 7, in which the first and second 
chains are linked only by Watson-Crick paired nucleobases. 

9. The compound of claim 5, which further comprises a region 
having a sequence of at least 14 contiguous nucleobases or a 
pair of regions having sequences of at least 7 contiguous 
nucleobases wherein each said sequence is the sequence of a 
fragment of a mammalian gene . or of its complement. 
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10. The compound of claim 5, which further comprises a 
region having a sequence of at least 14 contiguous 
nucleobases or a pair of regions having sequences of at least 
7 nucleobases wherein each said sequence is the sequence of a 
fragment of a plant gene or of its complement. 

11. The compound of claim 5, in which the first strand 
comprises at least 9 ribo-type nucleobases. 

12. The compound of claim 5, in which the second strand 
contains no ribo-type nucleobases. 

13. The compound of claim 5, which contains no peptide 
nucleobases . 

14. The compound of claim 13, which comprises only 
pentosef uranosyl nucleobases. 

15. The compound of claim 14, which comprises one 
oligonucleobase chain. 

16. The compound of claim 15, which further comprises: 

a) a hairpin cap attached to the first strand or the 
second strand; and 

b) a linker having a first end and a second end, which 
first end is covalently attached the 3 1 end of the 
first strand and which second end is covalently 
attached to the 5' end of the second strand. 

17. The compound of claim 16, in which the 5' end of the 
first strand is the 5' terminus. 
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18. The compound of claim 16, in which the 3' end of the \ 
second strand is the 3' terminus. 1 

i 

t 

The compound- *of claim 15, which further comprises: 
a hairpin cap attached to the first strand or the 
second strand; and 

a linker having a first and a second end, which first 
end is covalently attached the 5' end of the first 
strand and which second end is covalently attached to 
the 2' end of the second strand. j 

! 

The compound of claim 19, in which the 3' end of the ' 
first strand is the 3' terminus. ' 

i 

21. The compound of claim 19, in which the 5' end of the ' 
second strand is the 5' terminus. 

22. The compound of claim 5, in which the first strand 
comprises a first ribo-segment consisting of at least 6 ribo- \ 
type nucleobases, a second ribo-segment consisting of at 

least 3 ribo-type nucleobases, and an intervening deoxyribo- 1 
segment consisting of at least 4 2 » -deoxyribo- type ! 
nucleobases. 



19, 



a) 



b) 



23. The compound of claim 5, which further comprises a 
protecting group that protects the 3' end of the first or 
second strand or the 5' end of the first or second strand. 

24. The compound of claim 5, in which the first strand 
comprises a nuclease resistant ribo-type nucleobase other 
than a 2 ' OMe-ribonucleotide . 
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25. The compound of claim 5, in which the second strand 
comprises a 2 ' -deoxyribo- type nucleoside or a peptide 
nucleobase . 

26. A method of isolating a eukaryotic cell having an 
alteration of a target sequence on a chromosome of the cell, 
which alteration causes a selectable phenotypic change of the 
cell, which comprises the steps of: 

a) providing an oligonucleobase which comprises: 

1) a first strand of nucleobases having a 3' end and 
a 5' end, which first strand comprises: 

i. a total of at least 15 nucleobases; 

ii. at least three nuclease resistant ribo- 
type nucleobases that are selected from 
the group consisting of 2 1 AX-nucleosides, 
2 ' AX-nucleotoids, and 2 ' AR-nucleotides , 
wherein A=0, F, CI, or Br and, when A=0, 
then X=H or C 5 6 alkane and R=C!. 6 alkane, 
and when A*0, then X and R are omitted; 
and 

iii. at least 3 contiguous ribo-type 
nucleobases, which can be the same as or 
in addition to the nuclease resistant 
ribo-type nucleobases, 

in which at least one ribo-type nucleobase is 
other than a 2'0-methyl substituted nucleotide; 
2) a second strand of nucleobases having a 3* end and 
a 5 1 end, 

wherein the nucleobases of the second strand are 
Watson-Crick paired to the nucleobases of the first 
strand, and wherein the contiguous ribo-type 
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nucleobases of the first strand are Watson-Crick 
paired to 2 ' -deoxyribo-type nucleobases, which 
oligonucleobase further comprises two homlogous 
regions that are each homologous with a region of the 
target sequence and a mutator region, disposed there 
between; 

b) maintaining said oligonucelobase within the nucleus of 
the cell; and 

c) selecting a cell having said selectable phenotypic 
change . 

27. The method of claim 26, wherein the mutator region is 6 
or fewer nucleobases. 



28. The method of claim 26, wherein the alteration is a 
deletion of from 6 to one nucleobase. 

29. The method of claim 26, wherein the homologous regions 
together comprise at least 9 and not more that 25 ribo-type 
nucleobases . 



30. A method of introducing an alteration in a target 
sequence on a chromosome of a cultured mammalian cell, which 
comprises the steps of: 

a) providing an oligonucleobase which comprises: 

1) a first strand of nucleobases having a 3' end and 
a 5' end, which first strand comprises: 

i- a total of at least 15 nucleobases; 

ii- a t least three nuclease resistant ribo- 
type nucleobases that are selected from 
the group consisting of 2 1 AX-nucleosides , 
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2 ' AX-nucleotoids , and 2 ' AR-nucleotides , 

wherein A=0, F, CI, or Br and, when A=0, 
0 

then X=H or C x . 6 alkane and R=C 16 alkane, 
and when A*0, then X and R are omitted; 
and 

iii. at least 3 contiguous ribo-type 

nucleobases, which can be the same as or 
in addition to the nuclease resistant 
ribo-type nucleobases; and 
2) a second strand of nucleobases having a 3 1 end and 

a 5 1 end, 

wherein the nucleobases of the second strand are 
Watson-Crick paired to the nucleobases of the first 
strand, and wherein the contiguous ribo-type 
nucleobases of the first strand are Watson-Crick 
paired to 2 ' -deoxyribo- type nucleobases, which 
oligonucleobase further comprises two homlogous 
regions that are each homologous with a region of the 
target sequence and a mutator region, disposed there 
between; and 

b) maintaining said oligonucleobase within the nucleus of 
the cell, 

whereby the alteration is introduced in the target sequence, 
provided that the alteration is other than an alteration that 
produces a selectable phenotype. 

31. The method of claim 30, wherein the mutator region is 
six or fewer nucleobases. 

32. The method of claim 30 , wherein the alteration is a 
deletion of from six to one nucleobases. 
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33. The method of claim 30, wherein the homologous regions 
together comprise at least 9 and not more that 25 ribo-type 
nucleobases . 

34. The method of claim 30, wherein^ the alteration repairs a 
disease-causing mutation in a human cell. 

35. The method of claim 30, wherein the alteration results 
in the inactivation of the gene that contains the target 
sequence. 

36. A method of introducing an alteration in a target 
sequence of the genome of a plant cell, which comprises the 
steps of : 

a) providing an oligonucleobase which comprises: 

1) a first strand of nucleobases having a 3' end and 
a 5' end, which first strand comprises: 

i. a total of at least 15 nucleobases; 

ii. at least three nuclease resistant ribo- 
type nucleobases that are selected from 
the group consisting of 2 • AX-nucleosides , 
2 ' AX-nucleotoids, and 2 * AR-nucleotides, 
wherein A=0, F, CI, or Br and, when A=0, 
then X=H or alkane and R^C^ alkane, 
and when A*0, then X and R are omitted; 
and 

iii. at least 3 contiguous ribo-type 
nucleobases, which can be the same as or 
in addition to the nuclease resistant 
ribo-type nucleobases; and 
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2) a second strand of nucleobases having a 3 1 end and 
a 5 1 end , 

wherein the nucleobases of the second strand are 
Watson-Crick paired to the nucleobases of the first 
strand, and wherein the contiguous ribo-type 
nucleobases of the first strand are Watson-Crick 
paired to 2 ' -deoxyribo- type nucleobases, which 
oligonucleobase further comprises two homologous 
regions that are each homologous with a region of the 
target sequence and a mutator region, disposed there 
between; and 

b) maintaining said oligonucleobase within the nucleus of 
the cell, 

whereby the alteration is introduced in the target sequence. 

37. The method of claim 36, wherein the mutator region is 
six or fewer nucleobases. 

38. The method of claim 36, wherein the alteration is a 
deletion of from six to one nucleobases. 

39. The method of claim 36, wherein the homologous regions 
together comprise at least 9 and not more that 25 ribo-type 
nucleobases. 
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